The Nature of Speech and Its Interpretation 1 

By HARVEY FLETCHER 

Introduction 



V: 



'ARIOUS phases of this subject have received serious study by 
phoneticians, otologists, and physicists. On account of its universal 
interest, it has received attention from men in many branches of 
science. In spite of the large amount of time devoted to the sub- 
ject, the progress in understanding its fundamental aspects has been 
rather slow. At the present time the physical properties which 
differentiate the various fundamental speech sounds are understood 
in only a very fragmentary way. Some very interesting and pains- 
taking work has been done on the physical analysis of vowel sounds, 
but the results to date are far from conclusive. Although several 
theories have been advanced to explain the way in which the ear 
interprets sound waves, they are still in the controversial stage. 

The material which is presented here is the result of an investi- 
gation which has been carried on in the Research Laboratories of 
the American Telephone and Telegraph Company and Western 
Electric Company during the past few years. 

To make a quantitative study of speech and hearing it is necessary 
to obtain the speech sounds at varying degrees of loudness and with 
definitely known amounts of distortion. The main reason why so 
few real results have been obtained in the investigation of speech 
sounds is due to the fact that it is extremely difficult to change the 
volume and distortion of these sounds by acoustic means. Due to 
recent developments in the electrical transmission of speech it is 
possible to produce the equivalent of these changes by electrical means. 
For this purpose a telephone system was constructed which repro- 
duced speech with practically no distortion. It was arranged so that 
by means of distortionless attenuators the volume of reproduced 
speech could be varied through a very wide range, and so that by 
introducing various kinds of electrical apparatus the transmitted 
speech wave could be distorted in definitely known ways. 

A method was developed for measuring quantitatively the ability 
of the ear to interpret the transmitted speech sounds under different 
conditions of distortion and loudness. By choosing these conditions 
properly, considerable information was gained concerning both speech 
and hearing. This indirect method of attack has a distinct advantage 

1 Presented at a meeting of the Electrical Section of the Franklin Institute held 
Thursday, March 30, 1922. Reprinted from the Journal of the Franklin Institute 
for June 1922. 
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for engineering purposes, in that it measures directly the thing of 
most interest, namely, the degrading effect upon telephone conversa- 
tion of introducing electrical distortion into the transmission circuit. 
However, the application of the results is not limited to this particular 
field. 



Method of Measuring the Quality of Speech 

Briefly stated the method consists in pronouncing detached speech 
sounds into the transmitting end of the system and having observers 
write the sounds which they hear at the receiving end. The com- 
parison of the called sounds with those observed shows the number 
and kinds of errors which are made. The per cent of the total sounds 
spoken which are correctly received is called the articulation of the 
system. 

Table I. 
Classification of the Speech Sounds. 
Pure Vowels 




Combinational and Transitional Vowels 
w — y — ou — I — h 

Semi-vowels 

1 - r 

Stop Consonants 



Voiced 
b 
d 
J 
g 
Fricative Consonants 
Voiced 



Unvoiced 

P 

t 

ch 

k 



Nasalized Formation of Stop 

m lip against lip 

n tongue against teeth 

— tongue against hard palate 

ng tongue against soft palate 



th (then) 
zh (azure) 



Unvoiced Formation of Air Outlet 

f lip to teeth 

s teeth to teeth 

th (thin) tongue to teeth 

sh tongue to hard palate 
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In order to understand the construction of the articulation lists 
and also to interpret the results of this investigation, I desire to 
give here a brief classification of the speech sounds, which is based 
upon the position of the various speech organs when the sounds are 
being produced. It is shown in the accompanying table (Table I). 

The pure vowels are arranged in the vowel triangle, which is familiar 
to phoneticians. Starting with the sound u the lips are rounded 
and there is formed a single resonant cavity in the front part of the 
mouth. Passing along the left side of the triangle from u to a the 
mouth is gradually opened with the tongue lowered to form the suc- 
cessive vowels. Going along the right side of the triangle from a to 
e, the tongue is gradually raised to the front part of the mouth form- 
ing two resonant chambers in the mouth cavity. An infinite number 
of different shadings of these vowels may be produced by placing 
the mouth in the various intermediate positions, but the ones which 
are shown were chosen as being the most distinct. 

The sounds w, y, ou, i and h are classed as combinational and 
transitional vowels. As the mouth is placed in the position to say Q 
and then suddenly changed so as to form any other vowel in the 
triangle, the result obtained is signified in writing by placing the 
letter w before the vowel. In a similar way we get the effect usually 
designated by y if the position of the vowel suddenly changes from 
e to any other vowel. An infinite variety of dipthongs can be formed 
by changing the position of the mouth necessary to form one vowel 
to that to form another without interrupting the voice. The most 
distinct and principal ones used in our language are formed by passing 
from the sound a to either extreme corner of the triangle and are 
known as ou and I. When a vowel commences a syllable it is formed 
by suddenly opening the glottis, permitting the air, which has been 
held in the lungs, to escape into the mouth, which is formed for the 
proper vowel. If the glottis remains open and the vowel is started 
by the sudden contraction of the lungs, we have the effect which is 
represented in writing by placing an h before the vowel. The sounds 
1 and r are called semi-vowels because the voice train is partially 
interrupted, although the sound can be continued. The stop and 
fricative consonants are classified in a manner which is familiar to 
phoneticians. 

It will be noticed that the markings are not those used in the inter- 
national phonetic alphabet which were entirely too complicated for 
practical use. Only the bar and accent stroke are used. These 
can be written quickly and with little chance of error. 

In order to pronounce these speech sounds properly, they must 
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be combined into syllables. For the purpose of this investigation 
they were combined into mono-syllables of the simple types con- 
sonant-vowel, vowel-consonant, and consonant-vowel-consonant. 

To eliminate memory effects every possible combination of the 
sounds into these types of syllables was used unless there was a good 
reason for excluding it. The complete list contained 8700 syllables. 
For convenience of testing these syllables were divided into groups 
of fifty. Each group contained the same kind and number of syllable 
forms and an equal number of each of the fundamental vowel and 
consonant sounds. 

Table II. 
Speech-sound Testing List. List No. 160 





Speech-sound 


Key-word 




Speech-sound 


Key-word 


1 


ha 


ho(t) 


26 


gob 


go-fb 


2 


ha 


hay 


27 


shol 


shoal 


3 


wa 


wa(g) 


28 


ros 


rus(t) 


4 


\vi 


wi(th) 


29 


jod 


ju(g)+d 


5 


vou 


vow 


30 


bok 


buck 


6 


ar 


air 


31 


zik 


z + (d)ike 


7 


ez 


e(bb)+z 


32 


l.ich 


buy-fch 


8 


fish 


you+sh 


33 


kith 


ki(te)+th 


9 


an 


on 


34 


g't 


gui(de)+t 


10 


id 


(l)id 


35 


yif 


y+if 


11 


jouv 


jow(l)+v 


36 


sin 


sin 


12 


moush 


mou(nd)-r-sh 


37 


term 


term 


13 


rour 


r+our 


38 


mgrl 


m+earl 


14 


zuth 


z-i-(s)oothe 


39 


p£rv 


p + (n)erve 


15 


hus 


who-f-s 


40 


yet 


y-j-eat 


16 


chush 


ch + (p)ush 


41 


bel 


b+eel 


17 


jum 


j + (f)oo(t)+m 


42 


zef 


ze(al)-r-f 


18 


thup 


th + (s)oo(t)+p 


43 


weng 


whe(n)+ng 


19 


fuch 


foo(t)-|-ch 


44 


kev 


k-|-ev(er) 


20 


wong 


wa(ll)-f-ng 


45 


hang 


hang 


21 


choth 


cha(lk)+th 


46 


pag 


p + (r)ag 


11 


toj 


ta(ll)+j 


47 


yas 


y+ace 


23 


kog 


k-l-aug(er) 


48 


dap 


d+ape 


24 


fon 


(tele)phone 


49 


yang 


ya(cht)+ng 


2.5 


dos 


dose 


50 


Ian 


1+on 



To illustrate the technique of articulation testing a sample list 
is given in Table II. In the first column the syllable is given in its 
phonetic form. A key-word showing how each syllable is pro- 
nounced is given in the second column. These syllables were written 
on cards which were shuffled each time before they were used, so 
that the order in which they were pronounced was entirely hap- 
hazard. One hundred and seventy-four similar lists were used in 
this work. In order to eliminate personal peculiarities, several 
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callers and observers were used. In Table III are shown the results 
obtained by an observer when this list was transmitted over a system 
which eliminated all frequencies above 1250 cycles per second. 



TRANSMISSION BRANCH 
ARTICULATION TEST RECORDING SHEET 

TITLE OF TEST 3Z03J_1 

CONDITION TESTED_£l»«_.fi3SS — Elites- izso " 
2-7-2Q A+ti„„ a t,on . S r, af >, er , ^""-^bserveR __ J1-J — 

CALLER H £-■ 



WORD 
ARTICULATION 

4-0 % 



TEST No. ' L 
UST No Lfe 



No. 


OBSERVED 


CALLED 


ERRORS 


No. 


OBSERVED 


CALLED 


ERRORS 


I 






e'r-a 


26 


zip 


th t4 p 


Hi-* 

LI —i 

t-h 
i=J - 


2 


2/7 


aft 


9.''. 


27 


Uo'd 


to'j 


3 




J : 

h/ a' 


a'-a 


28 


t,sh ... 


cMu$^ . 


" - ' _ 


4 








29 




x y^ < 




S 


cio b 






30 


\J *J 


z. a tli 


tl,-t 


6 


J 


uif 


f-s 


31 


ref 


ros 


o - c 


7 


J -/ 


m c 1 r 1 


e'r - <* 


32 




l /' 




8 


thin 




s-th 


33 


J 


ko'q 


k '} - 


9 




zTk 


k- r 


34 


J J 


1 n A 


O— a 


10 




L • 




35 


J 

tu til 


h" & .- 


h-t 

s- t% 


11 


J 


uas 


s-t 


36 


iti 






12 


— 7 


vou 


v-tt, 


37 


"1 


x^ 




13 


btp 


hrck 


ch-f> 


38 


-fnn 


,^ 




14 








39 


ko'th 


ch a'td 


eh-fc 


IS 






sh-s 


40 


rour 






16 




d o s 


%~-Zh 


41 


en 






17 




1 ^-" 




42 


hnk 


,/ 




18 


t;~ 




rr+ 


43 


>JS t 


u^~- 




19 


y 


r u 
kttU 


th-s 


44 


n' r 


a' r 


a' — o 


20 


ha 


i/ 




45 




v> sh 


"S?ru 


21 








46 




^ 




22 


del 


bsl 


b~d 


47 


J 
k o v 


peri/ 


p -k 
e'r-o 


23 


thick 


f»ch 


f-rtl 


48 


rSt 


zef 


.f-t 


24 






f inserted 


49 


1 a n 


^^ 




25 


ez 


</ 




50 


xh3l 


^—- 





Table III. 



The correct word is written opposite all of the syllables which 
were recorded incorrectly. The errors for each of the fundamental 
sounds were taken from this original sheet and recorded on an analysis 
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sheet as shown in Table IV, for example it will be noticed that p 
was recorded as k 24.4 per cent, as p 45 per cent, and as t 22.2 per cent 
of the times called. On the other hand the sound w was only recorded 
incorrectly 1 per cent of the times called. 

For this system the consonant articulation was 65.8 and the vowel 
articulation 83.4. 



Description of the System for Reproducing Speech Sounds 

The telephone system used in this investigation is probably more 
nearly perfect than any other which has yet been built. Its essential 
elements are a condenser transmitter to receive the speech waves 
and transform them into the electrical form, an amplifier for magni- 
fying the intensity of the electrical speech currents, an attenuator 
for controlling the intensity, an equalizing network, and a receiver 
for delivering the speech to the ear. A schematic arrangement ., of 
the circuit is shown in Fig. 1. 



~2M.F~~~ }W" I2W 




Equalizing 
M 2M.F.20V W.N°l. w ° rh 



'B' Battery 
ZW Volts 



'B" Battery 
240 Volts 



Fit;. 1. — High Quality Telephone System 

A detailed description of the construction and operation of the 
condenser transmitter has been given by Crandall and Wente and 
published in the Physical Reinew. 1 It is simply an air condenser, 
one of its plates being a flexible metal diaphragm. 

A five-stage vacuum tube amplifier was used. Particular care 
was taken in coupling the stages together, so that the amplifier was 
practically free from frequency distortion. 

The attenuator consisted of a potentiometer arrangement which 
could reduce the amplitude of the speech waves to approximately 
one-millionth of their maximum values. 

The equalizing network was an arrangement of resistances, con- 

> Crandall, Phys. Rev., June, 1918; Wente, Phys. Rev., July, 1917. 
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densers and inductance coils having a frequency selectivity which 
was the complement of that of the rest of the system. 

The telephone receiver was a bipolar type having a special con- 
struction which was designed to broaden the range of frequency 
response. 

The reproducing efficiency of the system from the mouth of the 
speaker to the ear of the listener for each frequency is shown in Fig. 2. 
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The pitch or frequency of the tone is given on the X axis. The 
ordinates represent amplitude ratios or the number of times the 
amplitude of the tone reaching the ear was greater than that which 
entered the transmitter. It will be seen that this high quality system 
has practically a uniform response for all frequencies throughout the 
speech range. 

In order that its uniformity may be appreciated, a comparison 
curve is given. This curve shows the deviation in the .sensitivity 
of a typical individual ear from the average sensitivity of a large 
number of ears. The ordinates represent the ratio of amplitudes 
at the various pitches which was necessary to bring the tone to the 
threshold of audibility. It is evident that this deviation is much 
larger than the departure of the high quality circuit from uniformity. 

To show that this particular individual's curve is typical, the 
curves for both ears of 20 women are given in Fig 3. For convenience 
these curves' are plotted on logarithmic paper. If an arithmetic 
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scale is used, all of the curves below the mean arc crowded together 
in the small space between zero and one, and all those above the 
mean are stretched out from one to infinity. By using a logarithmic 
plot a symmetrical distribution is obtained. The method of obtain- 




ing these ear sensitivity curves was fully described in a recent paper 2 
given before the Natural Academy of Sciences. 

■ * It is interesting to note that they indicate that each individual 
has a hearing characteristic which is quite different from other in- 
dividuals. Consequently speech sounds differently to different 
persons. Any distortions of the speech sounds will necessarily 
affect some persons differently from others. It is evident then that 
in discussing speech and hearing we must deal with statistical averages. 
Experimental articulation tests showed that the ear interpreted 
the speech which was transmitted over this high quality system 
practically as well as that transmitted through the air. Some may 
wonder why such good quality is not furnished telephone users in 
commercial practice: Scientifically speaking, it is possible to furnish 
such quality, but it is evident that the equipment involved is so com- 

2 Fletcher and Wegel, Proc. Nut. Acad. Science, Vol. 8, Xo. 1, pp. 5-6, Jan., 1922. 
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plicated that such service would be altogether too costly for com- 
mercial use; people could not afford to pay for it. 

The Relation Between the Volume and Articulation of 
Undistorted Speech 

Articulation tests were made upon the high quality telephone 
system described above when it was set to deliver various intensi- 
ties from the threshold of audibility to very large values. The 
results shown as syllable articulation values are given by the curve 




Fig. 4. 



in Fig. 4. The abscissas in this curve represent loudness and are 
expressed as the natural logarithm of the number of times the speech 
wave amplitude has been decreased from the initial intensity at 
}/l inch in front of the mouth of the callers. This unit of loudness 
has never been given a name, and as a matter of convenience in 
this work it is called a napier. It will be noticed that when the 
volume is reduced W^/i napiers below the initial speech intensity 
the articulation becomes zero. This point also represents the value 
at which the speech becomes inaudible and corresponds to approxi- 
mately 1/1000 dynes per square centimetre pressure variation against 
the ear drum. In energy units it is a reduction of ten billion times 
below the initial speech intensity. For very loud initial speech 
this point is shifted about 1 napier. For purposes of comparison 
the intensity reductions are also indicated on the loudness axis. 

At 3 napiers below or at about 1/1000 of the initial speech in- 
tensity the articulation becomes a maximum. Louder speech than 
this seems to deaden the nerves so that a person makes a less accurate 
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interpretation of the received speech. These results were obtained 
in a room which was especially constructed to exclude outside noise. 
When noise is present at the receiving station the optimum loudness 
increases as the noise increases. 

The articulation data were analyzed so as to show the errors of 
each of the fundamental sounds. The curves given in Fig. 5 show 
the results of this analysis. It will be noticed that the volume at 
which errors begin to be appreciable is different for the different 
sounds and is usually higher for the consonants than for the vowels. 
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Within the precision of the test the intersection point on the X axis 
was the same for all the sounds, namely at 11.5 napiers. 

It will be noticed that the consonants are usually harder to hear 
than the vowels. However, the speech sounds e and 1, r, ng form 
notable exceptions to this general rule, since the former is among 
the most difficult, while the latter are among the very easiest speech 
sounds. The order in which the speech sounds are given here rep- 
resents their relative difficulty of interpretation when received at 
average intensities. At all intensities, the sounds th, f and v are 
the most difficult. Z, h and s become very difficult at weak volumes. 
The sounds i, ou, er and 6 are missed less than 10 per cent of the 
time, even with "very weak" intensity. At "average" volumes 
there are only three sounds more difficult than e while at "very 
weak" volumes there are 23 sounds more difficult. At very weak 
volumes 1, which is the easiest sound at "average" volumes is missed 
three times as often as e. 
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We will now pass to a consideration of the effect of distortion 
upon the articulation of the sounds. 

Description of Electrical Filters Used to 
Produce Distortion 

In order to investigate distortion we would like to be able to take 
the train of speech waves going from the mouth to the ear and oper- 
ate upon it in various ways such as eliminating frequencies in certain 
regions without marring or disturbing other frequencies. For ex- 

Low Pass Filter 




High Pass Filter 




Fig. 6. 



ample, if all frequencies above 1000 were eliminated, it would be 
possible to determine what intelligibility is carried by this range of 
frequencies. 

Fortunately one of the recent electrical inventions is admirably 
adapted for this purpose, namely, the electrical wave filter invented 
by Dr. G. A. Campbell. This device was used extensively in this 
investigation. 

The schematic circuit diagrams of the two types of filters which 
were used are given in Fig. 6. 

This arrangement of coils and condensers produces an electrical 
conductor with the unusual properties that it transmits without 
appreciable diminution in amplitude any frequency between certain 
limits and reduces the amplitude of all frequencies outside these 
limits to less than 1/1000 of their original value. By varying the 
numerical values of the inductances and capacities this transmitted 
range can be placed at any desired position. In the arrangement 
which was used in the investigation these coils and condensers were 
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housed in two boxes. The switching mechanism was arranged so 
that by turning a dial the condensers and coils were connected in 
such a way that the filter transmitted different frequency bands. 

In Fig. 7 are shown the transmission properties of the low pass 
filter when the dial is set to transmit frequencies from to 1500. 
It is seen that for frequencies below 1400 the amplitudes of the trans- 
mitted tones are always greater than .8 of their initial values, while 
for frequencies above 1500 the amplitudes are decreased to less than 
.001 of their initial values. These electrical filters were connected 
into the high quality circuit between the third and fourth stages 

Frequency Characteristics of 0~I500 Filter 
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of the amplifier as indicated in Fig. 1. This combination formed a 
system which would pick up a complex sound wave and transmit 
faithfully to the ear those component frequencies in any desired 
region and eliminate all other frequencies. 



Results of Articulation Tests with Filter Systems 

Articulation tests were made with these filter systems and the 
results analyzed as described above. In Fig. 8 the syllable articu- 
lation results are shown in graphical form. The ordinates for the 
solid curves represent the per cent of the articulation syllables called 
into the system which were correctly recorded at the observing end. 
The abscissas represent the so-called "cut off" frequency of the 
filter. For example on the curve labelled "Articulation L" the 
point (1000, 40) means that a system which transmits only frequencies 
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below 1000 cycles per second has a syllable articulation of 40 per 
cent. Similarly on the curve labelled "Articulation H" the point 
(1000, 86) means that a system which transmits only frequencies 
above 1000 cycles per second has a syllable articulation of 86 per 
cent. The dotted curves show the per cent of the total speech energy 
which is transmitted through the filter systems used in the articula- 
tion tests. These curves are derived from the results of Crandall 
and MacKenzie which were recently published. 3 

It will be seen that although the fundamental cord tones with 
their first few harmonies carry a large portion of the speech energy, 
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they carry practically none of the speech articulation. A filter 
system which eliminates all frequencies below 500 cycles per second 
eliminates 60 per cent of the energy in speech; but only reduces the 
articulation 2 per cent. A system which eliminates frequencies 
above 1500 cycles per second eliminates only 10 per cent of the speech 
energy, but reduces the articulation 35 per cent. A system which 
eliminates all frequencies above 3000 cycles per second has as low a 
value for the articulation as one which eliminates all frequencies 
below 1000 cycles per second. This last statement may appear 
rather astonishing since it is contrary to the popular notion of the 
relative importance of various voice frequencies from an interpre- 
tation standpoint. 

The two solid curves intersect on the 1550 cycle abscissa and at 
65 per cent articulation, which shows that using only frequencies 

8 See preceding paper. 
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above or frequencies below 1550 cycles an articulation of 65 per 
cent will be obtained. The two dotted curves necessarily intersect 
at 50 per cent. 

The curves in Fig. 9 show how the articulation of some of the 
fundamental speech sounds was affected by eliminating certain 
frequency regions. The ordinate gives the number of times the 
sound was written correctly per 100 times called. As in Fig. 8 the 
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left hand curve shows the effect of eliminating all frequencies below 
and the right hand curve the effect of eliminating all frequencies 
above the frequency specified by the abscissa. 

These nine speech sounds were chosen as representing three im- 
portant classes. It is seen that the long vowels e, 1 and i can be trans- 
mitted with an error of less than 3 per cent when using either half 
of the range of frequencies. When using either frequencies from 
to 1700 or from 1700 to infinity e was interpreted correctly 98 per 
cent of the time. Similarly 1 was interpreted correctly 97 per cent 
of the time when using either the range from to 1000 or 1000 to 
infinity, and i 96 per cent of the time when using either the range 
from to 1350 or from 1350 to infinity. The short vowels, u, o and 
e are seen to have important characteristics carried by frequencies 
below 1000. More than a 20 per cent error is made on any of these 
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three sounds when frequencies below 1000 are eliminated. The 
elimination of frequencies above 2000 produces almost no effect. 

The fricative consonants s, z and th are seen to be affected very 
differently from those in the other two classes. These sounds are 
very definitely affected when frequencies above 5000 are eliminated. 
The sounds s and z are not affected by the elimination frequencies 
below 1500. It is principally due to these three sounds that the 
syllable articulation is reduced from 98 per cent to 82 per cent when 
frequencies above 2500 cycles are eliminated. 

A more detailed analysis of the articulation results on all the speech 
sounds showing the kind as well as the number of errors will be given 
in a future paper. 

Conclusion 

In conclusion then we see that the intensity of undistorted speech 
which is received by the ear can be varied from 100 times greater 
to one-millionth less than the initial speech intensity without notice- 
ably affecting its interpretation. The intensity must be reduced 
to one-ten-billionth of that initial speech intensity to reach the thres- 
hold of audibility for the average ear. Also it is seen that any ap- 
paratus designed to reproduce speech and preserve all of its char- 
acteristic qualities must transmit frequencies from 100 to above 
5000 cycles with approximately the same efficiency. Although most 
of the energy in speech is carried by frequencies below 1000, the 
essential characteristics which determine its interpretation are carried 
mostly by frequencies above 1000 cycles. In ordinary conversation 
the sounds th, f and v are the most difficult to hear and are responsi- 
ble for 50 per cent of the mistakes of interpretation. The character- 
istics of these sounds are carried principally by the very high 
frequencies. 

It is evident that progress in the knowledge of speech and hearing 
has a great human interest. It will greatly aid the linguists, the 
actors, and the medical specialists. It may lead to improved devices 
which will alleviate the handicaps of deaf and dumb persons. Fur- 
thermore this knowledge will be of great importance to the telephone 
engineer, and since the telephone is so universally used, any improve- 
ment in its quality will be for the public good. 

These humanitarian and utilitarian motives as well as the pure 
scientific interest have already attracted a number of scientists to 
this field. Now that new and powerful tools are available, it is 
expected that in the near future more will be led to pursue research 
along those lines. 



